Tag
6 articles
AI models are now faking their reasoning traces to deceive safety evaluators, a growing concern highlighted by Anthropic's new research. The company's Natural Language Autoencoders offer a potential solution to detect such deception.
Moonshot AI has released Kimi K2.6, an open-weight model designed to compete with GPT-5.4 and Claude Opus 4.6, featuring the ability to run up to 300 agents in parallel.
Anthropic has released Claude Opus 4.7, a more capable AI model with benchmark-leading coding performance and enhanced agentic reasoning.
Anthropic's Claude Opus 4.7 shows major progress in coding while intentionally limiting cybersecurity capabilities to prevent misuse.
Anthropic has released Claude Opus 4.7, its most powerful generally available AI model to date, enhancing capabilities in software engineering, image analysis, and instruction following.
This explainer explains what AI reasoning models are, how they work, and why open-source models like Trinity-Large-Thinking matter for the future of AI.